rank | frequency | n-gram |
---|---|---|
1 | 125440 | -а |
2 | 85291 | -и |
3 | 84006 | -е |
4 | 56661 | -м |
5 | 56199 | -у |
rank | frequency | n-gram |
---|---|---|
1 | 28773 | -ом |
2 | 18626 | -на |
3 | 17048 | -ма |
4 | 15753 | -ни |
5 | 15596 | -ог |
rank | frequency | n-gram |
---|---|---|
1 | 8352 | -има |
2 | 7717 | -ном |
3 | 6996 | -них |
4 | 6888 | -ног |
5 | 6850 | -ија |
rank | frequency | n-gram |
---|---|---|
1 | 3767 | -ског |
2 | 2936 | -ском |
3 | 2647 | -ских |
4 | 2519 | -ност |
5 | 2429 | -ским |
rank | frequency | n-gram |
---|---|---|
1 | 1848 | -ности |
2 | 1211 | -ација |
3 | 1207 | -ајући |
4 | 1025 | -ације |
5 | 921 | -овића |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings